Unfolding accessibility provides a macroscopic approach to temporal networks 
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An accessibility graph of a network contains a link, wherever there is a path of arbitrary length 
between two nodes. We generalize the concept of accessibility to temporal networks. Building an 
accessibility graph by consecutively adding paths of growing length (unfolding), we obtain infor- 
mation about the distribution of shortest path durations and characteristic time-scales in temporal 
networks. Moreover, we define causal fidelity to measure the goodness of their static representation. 
The practicability of our proposed methods is demonstrated for three examples: networks of social 
contacts, livestock trade and sexual contacts. 



PACS numbers: 89.75.Hc, 89.75.Fb, 05.90. +m 

Many real world systems can be described in terms of 
networks [lj, i.e. sets of nodes, or vertices, and edges con- 
necting node pairs. Prominent examples are social con- 
tacts, flow of people between cities, trade between coun- 
tries or the world wide web [2HB]. In many cases, a static 
network description is sufficient: an edge is taken present, 
whenever a connection between the corresponding nodes 
was established during the whole data acquisition time. 
Mathematically, static networks are considered as graphs 
and can be represented by adjacency matrices. These 
contain unit elements in the entries corresponding to the 
indices of nodes connected by an edge, all other elements 
being zeroes. This representation is used in a large num- 
ber of algebraic methods for network analysis [7, 8J. 

Every route across the network connecting two vertices 
along the network's edges is called a path. For a macro- 
scopic view of a network, it is often sufficient to know, 
whether the nodes are connected by a path of a whatever 
length, i.e. whether one node is accessible starting from 
another one. Accessibility can be mapped onto a single 
mathematical object, the accessibility matrix of the net- 
work, containing unit entries for indices corresponding 
to the pairs of the nodes connected by paths of arbitrary 
length [8]. This matrix is the adjacency matrix of 
the accessibility graph. To build this graph, one starts 
with the network itself and first adds edges between the 
nodes connected by paths of length 2. Then one subse- 
quently adds longer paths as edges step-by-step (fig. [I]), 
until there are no more new paths to add. It turns out 
that valuable insights into the network structure are not 
only given by the accessibility graph itself, but also by 
the graphs obtained on intermediate stages of its step-by- 
step construction. We call this step-by-step construction 
the unfolding of the accessibility graph. 




Figure 1: Unfolding of an accessibility graph. Starting with 
a network (left) we unfold its accessibility graph by adding 
direct edges for paths of length 2 (center) and so forth. The 
fully unfolded accessibility graph is shown on the right. 



Although the concept of accessibility is plausible in 
static networks, it is still not clear how to generalize it 
to networks, where edges are not constant over time. 
Many systems have to be represented as temporal net- 
works: their nodes being fixed, but edges, representing 
connections between them, are only present during cer- 
tain periods of time [9-1 3J, see [14J for a review. Tem- 
poral networks are not graphs and cannot in general be 
represented by single matrices. Therefore, most atten- 
tion on temporal networks has focussed on data-driven 
approaches [lOj [12j [15]. Work concerning macroscopic 
properties of temporal networks is rare. This is partly 
due to the absence of an established formalism to treat 
these objects [H|HS]. A time- aggregated, static represen- 
tation of a temporal network may contain paths, which 
do not follow a causal sequence of connections between 
nodes, i.e. paths which cannot be taken in a real sys- 
tem. A static representation may be not sufficient for 
the description of a temporal network. 

The first step for the introduction of a formalism for 
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temporal networks will thus be taking into account the 
causality of paths. In the present Letter we introduce 
accessibility graphs of temporal networks and use them 
for a description of their macroscopic properties. In ad- 
dition, we define the measure of causal fidelity quantify- 
ing the goodness of a static representation of a temporal 
network. We first briefly review the basic concept of ac- 
cessibility in static networks and introduce the method 
of unfolding accessibility for the static case. Then we 
discuss generalizations of the concept of accessibility to 
temporal networks. Finally, we analyze three different 
data sets with respect to their characteristic time scales 
and compare their temporal and static representations. 

Let us consider a static network G = (V, E), where V is 
the set of vertices and E is the set of edges. The network 
is represented by an adjacency matrix A, with (A) uv = 1, 
if there is an edge between nodes u and v and (A) uv = 0, 
otherwise. The accessibility graph (transitive closure) of 
a static network is denoted by G* = (V,E*). It contains 
an edge (u,v) G E*, if a path of arbitrary length exists 
between the nodes u and v. Given an adjacency matrix 
A, the elements of the matrix A n are the numbers of 
paths of length n connecting pairs of nodes of the graph. 

Consequently, the cumulative matrix C n = A+A 2 H h 

A n has a nonzero entry (C n ) uv ^ 0, if the nodes u and v 
are connected by a path of whatever length smaller than 
or equal to n. Summing up all powers of the adjacency 
matrix, i.e. building a matrix C n= N = 

]T-Li A * ( where 

N is the total number of nodes in the network), is related 
to the transitive closure of the network. 

Provided that we are interested only in the number of 
nodes that can be reached from each initial node within 
n steps, we record whether the elements of C n are zero 
or not. Changing every nonzero element of C n to unity 
yields the accessibility matrix P n . The matrix P n (with 
binary entries) can be treated as a Boolean matrix. Al- 
ternatively, one can consider the adjacency matrix A as 
a Boolean matrix from the very beginning. The prod- 
uct of two Boolean matrices is defined by using normal 
algebra, but Boolean arithmetics, interpreting multipli- 
cation as a logical AND (A) operation and addition as a 
logical OR (V) operation. The product of two Boolean 
matrices is (AB)^- = \/ k=1 a%k Abkj. In this notation, the 
accessibility matrix P n of path length n becomes [17] 



Pn = \/ A< > 



(1) 



where the i-th power of the boolean matrix A is com- 
puted using boolean matrix products. A brief discus- 
sion of some properties of the fully exploited accessibility 
graph P n =Ar is given in the supplementary information 
[15] . The number of edges in Pi gives the number of 
paths of length 1, and at every step n — >• n+1 the network 
is traversed by paths with one more edge. Thus, Eq. 
corresponds to a breadth-first-search (BFS) in the net- 



work. This relation is also used in the Floyd- Warshall 
algorithm [T71 [19j for the computation of shortest path 
lengths. When the BFS reaches the diameter D of the 
network, i.e. there are no more shortest paths to traverse, 
the accessibility matrix saturates, Pat = Pd. 

The total number of nonzero elements of the accessi- 
bility matrix P n is associated with the distribution of 
shortest path lengths. We define the density p(A) of an 
N x N matrix A with N 2 entries and nnz(A) nonzero 
elements as 



nnz(A) 
N 2 ' 



(2) 



When A is an adjacency matrix, p(A) defines the edge 
density of the network, giving the probability to find an 
edge between two nodes chosen at random. In analogy, 
the probability to find a path of length I < n between two 
randomly chosen nodes in a connected network is given 

by 



F n = P(l <n) = p(P n ). 



(3) 



Consequently, the path density p(P n ) gives the cumula- 
tive distribution of shortest path lengths in the network. 
The corresponding probability to find the shortest path 
of length n is given by f n (F n - F n _i), with F 0. 
The distribution saturates, when n reaches the diame- 
ter D of the network. For a single-component network 
Ed = p(Pd) — 1- For networks comprised of disjoint 
components p(Pd) < 1? since their accessibility matri- 
ces have less than N 2 nonzero entries [18J. In this case 
we consider F n as an "improper" probability distribution, 
which is not normalized to unity, and define its median 
as the value of n, where F n approaches one half of its 
saturation value Fjj. 

Temporal networks allow edges to vary over time, and 
are represented by a sequence of adjacency matrices A = 
{Ai, A2, . . . , At}, where each matrix A^ is a snapshot 
of the system at time t. The increment corresponds to 
the temporal resolution and T is the maximal time given 
in the data. The accessibility graph Q* = (V,E*) of a 
temporal network has to allow for only causal paths (time 
respecting paths), in which the temporal order of edges 
has to be taken into account: it contains an edge (u, v) G 
E* , if and only if there is a causal path between the nodes 
u and v. A time-aggregated, static representation follows 
by summing up all (Boolean) matrices in the sequence, 
A = V^ =1 A t . 

As we have seen for the static case, the product of two 
identical adjacency matrices gives information about the 
number of paths of length two. If we multiply different 
matrices, i.e. two matrices Ai and A2 in the sequence A 
we obtain non-zero entries, wherever nodes receive edges 
at time 1 and cast forth edges at time 2. It is impor- 
tant to emphasize that the entries (AiA2)ij vanish, if 
an edge received by node i at time 1 is not cast forth 
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to node j at exactly time 2. Using products of differ- 
ent adjacency matrices in the sequence A, we can derive 
a dynamic accessibility matrix exactly as it was done in 
the static case. This straight-forward approach, however, 
does not give useful results, because it does not include 
the possibility of node waiting times (see supplementary 
information |18|). 

To overcome this drawback, we allow for temporal 
shortcuts in the accessibility matrix by taking into ac- 
count products of adjacency matrices corresponding to 
remote time steps. As an example, if many nodes in the 
system receive links at time 5 and cast forth links at time 
100, the product A5A100 should make a large contribu- 
tion to the accessibility matrix. To introduce shortcuts, 
we define the accessibility matrix of a temporal network 
by adding an identity matrix 1 to each matrix in the 
sequence. This yields all ordered products of snapshot 
matrices in time: C n = III!=i(l + ^). The addition of 
an identity matrix can be interpreted as the ability of 
each node to keep edge information over time, i.e. it 
introduces memory into the system. 

Since we are not interested in the actual number of 
paths between two nodes, we can either set any nonzero 
element of C n to unity or, equivalently, use the Boolean 
formulation 



P n = /\(lVAi) 



(4) 



=1 V Ai V A 2 V A 3 V • • • V A n V 

V Ai A 2 V A 2 A 3 V Ai A 3 V Ai A 2 A 3 V • • • . 

Linear terms in Q correspond to the aggregated net- 
work. Higher order products (always respecting a causal 
sequence of indices) represent all possible direct connec- 
tions or temporal shortcuts. It should be noted that, 
due to the causality of the paths, the temporal accessi- 
bility graph of an undirected network is in general di- 
rected. The unfolding of the accessibility matrix given 
in Q can now be used to compute shortest-path distri- 
butions F n for temporal networks in the same manner 
as for the static case. Note that the index n has now 
the meaning of the duration of the path (total number 
of time steps necessary to traverse the system, regarding 
waiting times). It defines the time (in units determined 
by the temporal resolution of the used data) which, say, 
the passengers spent in motion. Thus, a temporal net- 
work may still show a small-world property as long as its 
time-aggregated representation is a small-world network. 
However, if the the links between nodes are activated 
only during distant periods in time, the spread over the 
network would be slow: even a small world can be a "slow 
world". 

We have to point out that the identity matrix on the 
right-hand side of Q is an artifact of the introduced 
memory term. It does not make any difference in undi- 
rected networks, whose accessibility matrices always ex- 



hibit non- vanishing diagonal entries for n > 2 [18J, but 
may make a difference in directed networks. However, 
since the number of diagonal elements is small compared 
to the total number of elements of a large matrix, this 
deviation is negligible in large networks. 

Temporal networks are often approximated by their 
time-aggregated counterparts. However, some paths in a 
static network representation do not follow a chronolog- 
ical order and are never taken. In order to quantify the 
quality of a static representation, we define the causal fi- 
delity c as the fraction of the number of paths in a static 
network which can be also taken in a temporal one: 



p(Pt) 
p(Pt)' 



(5) 



The values of c lay in the interval < c < 1, where c = 1 
means that the path density of a static representation 
does not differ from the temporal system. Low values 
of c indicate that the majority of paths in the static ap- 
proximation do not follow a causal sequence of edges and 
thus are not present in the temporal one. 

As examples we consider several real-world networks 
showing different time scales ranging from three days [20] 
to six years [21J, see Table [l| 

Table I: Temporal network data sets. 
Data set size snapshots t-res. type source 



Conference 113 
Pig trade 89745 
Sexual contacts 16730 



10618 20 s undirected [20] 
465 1 d directed [22] 
2232 1 d undirected I2T1 



a www. hi- tier.de. Data access restricted to competent authorities 
in Germany. 

We unfold the accessibility graphs of those networks 
and obtain path densities and distributions of shortest 
path durations. Hereby, we treat the path density as 
an (improper) cumulative distribution. The results are 
shown in figures |2j [3] and [4] 

Figure [2] shows results for a network of conference at- 
tendees. Every face-to- face contact between two persons 
is recorded as an undirected edge. The data set covers 
a time span of 3 days, separated by time spans of no 
activity (nights). The density of the accessibility graph 
in figure [2] shows clear and fast saturation at a value of 
p = 0.9917, indicating that after only 3 days a large part 
of the network is causally connected. More than 70% 
of the possible paths are traversed within only one day 
(black line) . The median of the distribution is reached in 
less than 6 hours. An aggregated network representation 
gives a good approximation of the system, which is also 
reflected in the high causal fidelity of c « 0.99 of its static 
representation. 

Figure [3] shows the results for the network of pig trade 
in Germany over a time span of 455 days. Every node 
in the network is an agricultural holding and an edge is 
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Figure 2: (Color online) Density of the accessibility graph 
(black line) and distribution of shortest path durations (red) 
for a network of face-to-face contacts at a conference. 



present, when a holding sells livestock animals to another 
one. In contrast to the two other data sets, the pig trade 
network is directed. This is reflected in relatively low 
values of the path density. The shortest path lengths 
show a broad distribution (red). Although the temporal 
diameter of the pig trade network is by far larger than the 
observation period of 455 days, the distribution shows a 
global maximum (i.e. possesses a mode) at around 120 
days. This means that the typical spreading time-scales 
are of the order of 100 days. The comb-like structure 
of the graph reflects the time-inhomogeneity of trade, 
which is not so frequent on Saturdays and almost absent 
on Sundays. The static representation of the network 
captures its features sufficiently well (c = 0.72) and can 
be in principle used in epidemiological modeling. The 
same behavior can be expected for pig trade networks of 
other countries, because it is determined by the typical 
lifetime of a livestock pig. 



Pig trade 
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fleets physical contacts between prostitutes and their cus- 
tomers. Each contact is recorded as an undirected link. 
Here, no saturation of the path density is reached within 
the study period, and the distribution of shortest path 
durations does not show a clear maximum. The figure 
demonstrates that the majority of shortest paths takes 
more than 2 years to traverse. Even if time-scales cannot 
be resolved from the data, the observation is useful to 
define a minimum duration of future field observations 
and may be of use for the implementation of vaccina- 
tion strategies. The figure shows that it takes at least 3 
years to obtain a considerable density of the accessibility 
graph. The sexual contact network is only poorly repre- 
sented by its aggregated counterpart, as reflected by low 
causal fidelity of this representation (c = 0.38). A static 
network would significantly overestimate the number of 
transmission paths, and the system has to be analyzed 
from a temporal network perspective [21J. 

Sexual contacts 



4x10" 



_Q 
03 

_Q 
O 




Tl 

CD 



CD- 
CD 
ZD 

ifi 



100 200 300 400 500 

Shortest path duration (days) 



1 2 3 4 5 6 

Shortest path duration (years) 

Figure 4: (Color online) Density of the accessibility graph 
(black line) and distribution of shortest path durations (red) 
for a network of sexual contacts. 



To conclude, we introduced temporal accessibility 
graphs as a formalism to analyze temporal network data 
in terms of causal paths. The method is the temporal 
generalization of the transitive closure of a static net- 
work. It gives information about the distribution of time- 
scales in the system and also provides an instrument for 
quantifying the quality of their simpler, static represen- 
tation. The examples of real networks considered demon- 
strate the practicability and usefulness of this approach. 

This research was funded by the Federal Ministry of 
Education and Research, grant 13N9520. 



Figure 3: (Color online) Density of the accessibility graph 
(black line) and distribution of shortest path durations (red) 
for a network of pig trade in Germany. 



Figure [4] shows the distribution of shortest path dura- 
tions for a network of sexual contacts. The network re- 
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SUPPLEMENTARY MATERIAL 



PROPERTIES OF STATIC ACCESSIBILITY 
GRAPHS 



We briefly report some properties of the fully exploited 
accessibility graph 



N 



n=N 



\JA\ 



(6) 



2=1 



where N is the total number of nodes in the network. If 
the network is connected, Pat is a matrix with all unit 
entries, i.e. it has ^^(Pn)^ = N 2 nonzero elements. In 
networks consisting of disjoint components, P^ can be 
transformed to a block-diagonal form, with blocks filled 
with unit entries. Due to the block structure, the matrix 
has less than N 2 nonzero entries in this case. In directed 
networks, the diagonal entries of P n can be or 1 for any 
n. The diagonal entries of P n in undirected connected 
networks are unities for all n > 2, because there is always 
a path of length 2 from any node in the network back to 
itself. 

The distance between two nodes is the length of the 
shortest path connecting them. The diameter D of the 
network is the maximum distance between any two nodes 
in a network. Increasing n in equation ([6| successively 
adds longer shortest paths into P n . When n approaches 
the diameter of the network, no new nodes can be reached 
by considering longer shortest paths. Thus, the diameter 
D of a network is the minimal number so that Pd+i = 
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Figure 5: Density of the accessibility graph of a directed ran- 
dom network vs. n. The path density saturates at n — 18 
(diameter of the network). Network: 1000 nodes, 2000 di- 
rected edges. 
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Figure 6: Shortest path length distribution as it follows from 
fig. [5] Mean value = 8.18. 



STRAIGHT-FORWARD TEMPORAL 
GENERALIZATION OF STATIC ACCESSIBILITY 



EXAMPLE: UNFOLDING ACCESSIBILITY OF A 
DIRECTED STATIC ERDOS-RENYI GRAPH 

As an example, we plot the path-density p(P n ) for 
a directed Erdos-Renyi graph G e = (V,E) with 1000 
nodes and 2000 edges in figure [5] The curve saturates for 
n > 18 indicating that 18 is the diameter of the network. 
The saturation value of /9 sat (P n ) ~ 0.7 is smaller than 
unity, since the network G e is not strongly connected. 
The median of path lengths is located at n = 8, where 
the density approaches p(P n ) ~ 0.35. The correspond- 
ing shortest path length distribution f n shows that the 
shortest path length is on average much smaller than the 
number of nodes in the network (figure |5| indicating the 
small- world nature of the network. 



A straight-forward way of generalizing the accessibility 
matrix to a system of the form 

A = Ai,A 2 ,...,A T (7) 

could consist in replacing the powers of the same matrix 
A by products of different matrices A t : 

n i 

Cn = Yl II A ^ = Al + Al A2 + A l A 2 A 3 + ' • ' (8) 

i=l j=l 

and 

n i 

Vn = V A A i = Al V Al A2 V A i A 2 A 3 V • • • . (9) 

i=lj=l 

Every higher order product • • • A m in ([9| contributes 
to V ni if and only if there is a chronological sequence of 



6 



edges placed at exactly the time steps k . . .ra- 
in many systems, however, there are waiting times be- 
tween two link activations. As an example, many nodes 
of the network could receive links at time 1 and be idle 
until they cast forth links at time 3. This factor plays 
a role in transportation or information networks, where 
passengers or information are transported to a node v and 
do not disappear, but wait for one time step, and are then 
transported further at time step 3. In the mathematical 
context, this means that the product A1A2 could yield 
zeros for a row/column v, but the product A1A3 would 
not. These waiting times cannot be captured by the ap- 
proach suggested by our simple ansatz. This drawback 
is reflected in the fact that the products of higher order 
vanish in sparse networks for purely statistical reasons, 
i.e. 

n 

lim TT Ai = 0. (10) 

n— )-oo -f- 

A dilution of the temporal network occurs, when the 
intrinsic node time scale (node waiting time) is much 
larger, than the edge dynamic (temporal resolution). An 
example was reported in |10| , where the maximum path- 
length for immediately consecutive edges was found to 
be 8 (days) for a network of cattle trade. However, the 
intrinsic timescale for cattle trade is in the order of years. 
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